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Abstract 

The discovery that the number of physically consistent string vacua is on the order of lO^*"^ has 
prompted several statistical studies of string phenomenology. Contained here is one such study that 
focuses on the Weakly Coupled Free Fermionic Heterotic String (WCFFHS) formalism. Presented 
are systematic extensions of the well-known NAHE (Nanopoulos, Antoniadis, Hagelin, Ellis) set of 
basis vectors, which have been shown to produce phenomenologically realistic models. Statistics 
related to the number of [/(l)'s, the specific gauge groups and their factors, non-Abelian singlets, 
and spacetime supersymmetries (ST SUSYs) are discussed for the full range of models produced. 
These findings are compared with prior results of other large-scale investigations. Statistical cou- 
pling between the gauge groups and the number of ST SUSYs is also discussed. In particular, 
for order-3 extensions there is a correlation between the appearance of exceptional groups and 
enhanced ST SUSY. Also discussed are some three-generation GUT models found among the data 
sets. These models are unique because they come from basis vectors which still have a geometric 
interpretation - there are no "rank-cuts" in these models. 
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I. INTRODUCTION 



Physically consistent string/M-theory derived models number at least approximately 
IQ500 [2j fjj^g large number of vacua has prompted both computational and analyti- 
cal examinations of the the "landscape" of possible string vacua, e.g. pl-fTT]. The Weakly 
Coupled Free Fermionic Heterotic String (WCFFHS) [T2HT5] approach to string model con- 
struction has produced some of the most phenomenologically realistic string models to date 
[T6H57] . Outlined in this section are the basic steps needed to construct models within this 
formalism. Also detailed are the methodology of systematic searches and the difficulties they 
pose, and an explanation of the NAHE set of basis vectors used to construct realistic string 
models. Finally, results of the preliminary order-2 and order-3 extension to the NAHE set 
are presented, with an emphasis on GUT group models. 

A. Weakly Coupled Free Fermionic Heterotic String Model Building 

The heterotic theories consist of closed strings which have independent sets of left and 
right moving modes. The left moving modes are ten dimensional superstrings, while the 
right moving modes are 26 dimensional bosonic strings. WCFFHS models are constructed 
by specifying the phases that fermion modes gain when parallel transported around non- 
contractible loops on a genus- 1 world sheet of a string. The number of possible values these 
phases may take is referred to as the order, while the number of "basis vectors" of phases 
used to specify the model is referred to as the layer. The linear combinations of the basis 
vectors produce sectors from which the physical states of the model are built. The force and 
matter content is then determined from the symmetries of these states. 

Modular invariance highly constrains the possible components of the basis vectors. A 
consistent set of basis vectors {<S^} is such that: 



where Nij is the least common multiple of the orders of the basis vectors cJf and , and 
the dot product is Lorentz, with a relative minus sign between the left-moving and the 
right-moving contributions. Additionally, 




(mod 8) (for even Ni), 



(mod 4) 



(1) 
(2) 
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The number of simultaneous periodic modes for any three basis vectors must be even. 

Taking all possible linear combinations of the basis vectors builds the full "geometry" of the 
model, designated by sectors {a}. States from a sector a are designated by their charge 
vectors Qs, defined by 

Q^ = ^d + F, (3) 

where F is all combinations of 0,±1. The massless (with regard to the Planck scale) states 
are such that 

Ql = 2, (4) 
Ql = 4, (5) 

in a real fermion basis. In a complex basis, the values are halved. A physical state in a 
model a state must survive all GSO projections (GSOPs) from basis vectors af. These 
require that 



Qd = ^ dikji + Sj (mod 2), (6) 



where Oj are the coefficients on the basis vectors used to construct a, Sj is (1) for bosonic 
(fermionic) basis vectors , and kji is an element of the GSO coefficient matrix. 
The GSO matrix must also follow its own modular invariance constraints, given by 

Njkij = (mod 2), (7) 

kij + kji = -df ■ af (mod 2), and (8) 
2 ■' 

hi + kii = ^df ■ df - Si (mod 2). (9) 



B. Challenges to Systematic Searches 

Systematic searches are designed to do a "complete" scan of the possible consistent sets 
of basis vectors and GSO coefficients that define models within a chosen parameter space. 
In the discussions herein, models are classified into the order (the number of available twists 
for right-moving modes) and the layer (the number of basic vectors). Searches with random 
sampling have already been performed [58146T] . However, random sampling methods have 
several difficulties which are non-trivial to address [61]. Systematic methods avoid these 
inherent difficulties because models are not "sampled" - they form a complete set. Described 



in this section are the motivations and challenges to systematic searches of WCFFHS models 
including model building speed, model data storage, and model comparison. 

C. The Scale of Systematic WCFFHS Searches 

The scope of the systematic WCFFHS model searches can be limited by several factors. 
The first, and most obvious, factor is that of model building speed. Because the number 
of models to be built can get quite large, minimization of construction speed is essential. 
Assuming the model building process takes around 1 second, a data set with 1,000,000 
models would take about 28 hours. As the number of "layers" of basis vectors in a model 
increases, the number of total models to build grows geometrically; each of the 1,000,000 
models now has on the order of 1,000,000 new basis vectors (likely more than that) with 
modular invariance. Thus, the total number of models in the data set of the next layer is 
an estimated 10^^. At one model every second, the layer-2 data set would be completed in 
approximately 3000 years. 

Several steps can be taken to reduce the time required for model construction, both from 
an analytical and implementation standpoint. From the implementation side, understanding 
optimization techniques when writing the model building software is essential. Minimizing 
file and screen i/o, as well as the amount of copying done in memory are crucial steps needed 
to reduce the computing time. 

Another computational concern is one of data storage. While a "master atlas" of basis 
vectors and the models to which they map is desirable, it is not feasible. The estimated 
average amount of space per model is 0.725 kB. For the 1,000,000 model data set, this 
amounts to 725 MB. The next layer would use approximately 725 x 10^ TB, making a 
pure "atlas" of WCFFHS models unreasonable to pursue. There are two solutions to this 
problem. One is to gather statistics as the models are generated without keeping the models 
themselves in memory. While this approach is less taxing on system resources, it docs have 
disadvantages. In particular, one must know the statistics to be gathered on the models 
prior to runtime. Any additional statistics would require a second run. Moreover, the total 
model set would have to be statistically analyzed, as opposed to the distinct models only, as 
the models are not being kept in memory. Double-counts that result from the construction 
method itself cannot be avoided with this approach. The other approach is to count the 
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distinct models only, writing those to a file that serves as a small repository. While a full 
discussion of uniqueness in WCFFHS models is discussed in the next section, it suffices to say 
there are disadvantages to this approach as well. In particular, the definition of uniqueness 
in these models can be somewhat nebulous at times. Additionally, each unique model that is 
found must be compared with the other unique models that have been constructed, making 
each model require more computing time to complete. 

Analytic techniques can also be used to reduce the number of redundant models produced. 
As these techniques are related to uniqueness in WCFFHS phenomenology, discussion of such 
analysis will be deferred until after uniqueness has been addressed. 

D. Uniqueness in WCFFHS Models 

The previously described computational limitations force the analysis to be only on mod- 
els that are considered distinct. However, the definition of distinctness amongst WCFFHS 
models is not always clear. In particular the "amount" of phenomenology done to distin- 
guish the models from one another must be balanced with the amount of computing time 
required to perform such analyses, as well as how easily the analyses can be automated. 

Consider two different basis vector sets producing the same gauge and non-Abelian matter 
content. Even though the actual charge vectors and gauge group eigenvalues for the two 
models may be different, the overarching group structures are identical. This could be 
due to a string-scale type symmetry within the construction method itself, in which case 
there would be some transformation that could be applied to one of the basis vector sets to 
reproduce the other. The U (1) charges of those two models, once diagonalized, would be the 
same. Such a symmetry in the construction method could in principle be revealed through 
an analytic study. However, it could also be the case that these models have non-equivalent 
U{1) structures, and that the GSOPs eliminated the states that were different between the 
models. Such models will not have identical superpotentials or D- and F-flat directions, 
all of which have significant impact on the phenomenological viability of a model. For the 
systematic searches in this study the U{1) charges of the matter representations will not 
be considered (although they will in future studies). Thus, two models will be considered 
identical herein if they have the same gauge and non-Abelian matter content and the same 
number of non-Abelian singlets. 
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TABLE I: Two models illustrating the inherent difficulty in comparing WCFFHS models. 



QTY 


SU{4:) 


SU{4:) 


SUiA) 


SO{W) 


^8 


1 


4 


4 


1 


1 


1 


1 


4 


1 


4 


1 


1 


1 


1 


4 


4 


1 


1 


2 


1 


6 


1 


10 


1 


2 


1 


1 


6 


10 


1 



QTY 


SU{A) 


SU{4) 


SU{4) 


50(10) 


Eg 


1 


4 


4 


1 


1 


1 


1 


4 


1 


4 


1 


1 


1 


1 


4 


4 


1 


1 


2 


6 


1 


1 


10 


1 


2 


1 


1 


6 


10 


1 



This approach has a caveat, however. As the models become increasingly complex, com- 
paring two models becomes increasingly difficult to automate. Consider the following two 
"toy" models whose particle content is presented in Tables [1} It is clear that these two 
models are equivalent if two of the SU (4) groups are switched. However, a simple boolean 
comparison of the gauge groups and matter states by a computer program will result in 
these models being counted as distinct. The root of the problem lies in the fact that the 
three SU (4) gauge groups are not identical — there are different matter representations 
that transform under them. The most obvious solution would be to perform brute force 
permutations on the identical gauge groups and resorting the matter representations. Such 
an approach would work, but takes a significant amount of computing time. Comparisons 
are the most called operation in a systematic search, however, and thus need to be as fast 
as possible in order for the search to be efficient. 

The solution implemented in this study is to propose and use a conjecture that defines 
uniqueness in a slightly stronger way. The conjecture is that identical models will have 
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TABLE II: Matter representation classes of the "toy" models. 



QTY Model 1 Classes Model 2 Classes 



SU{A),SU{A) 

1 (4,4) (4,4) 

1 (4,4) (4,4) 

1 (4,4) (4,4) 

SU{A),SO{10) 

2 (6,10) ((UO) 

2 (6,10) (6,10) 



TABLE III: Another "toy" model, declared non-existant by the conjecture about matter 

representation classes. 



QTY 


SU{A) 


SU{4) 


SU{4) 


50(10) 


Eg 


2 


4 


4 


1 


1 


1 


1 


4 


1 


4 


1 


1 


2 


1 


6 


1 


10 


1 


2 


1 


1 


6 


10 


1 



matter that fits into the same "classes" of representations. These matter representation 
classes are formed by ignoring the singlets (after verifying equal numbers of singlets in models 
under cpmparison) and considering only at the dimension of the non-Abelian representations 
and the gauge groups under which they transform. The classes of matter representations for 
the two example models are presented in Table |TT} Now it is clear the two models are likely 
equivalent. This would imply that modular invariance prevents models that have only one 
representation switched from those of a known modular invariant model. 

By this conjecture, the modular invariance of Table |T] models would imply the Table III 



model is not modular invariant. More theoretical work will need to be done to prove or 



disprove this conjecture. 

The conjecture does not fully remedy the problem of a preferred gauge group ordering. 
Generally, matter representations are limited by the charges they carry due to the mass- 
lessness constraints of the fermion states. Representations of larger dimension tend to carry 
smaller dimensional charges under the other groups in the model, if at all. Certain gauge 
groups produce representations of similar dimension, however, which still place an ordering 
on the gauge groups within a class of matter representation. In particular, the 4- and 5- 
dimensional representations of S0{5), the 4- and 6- dimensional representations of S'f/(4), 
and the 2- and 3-dimensional representations of SU{2)^'^^ can still cause double counting 
amongst models. To completely remove the dependence on ordering in WCFFHS models, 
the problem must be reduced to a counting problem. In addition to proving or disproving 
the conjecture, the following possibility could also be explored: Two models with identical 
gauge groups and ST SUSYs are identical if they have the same total number of distinct 
matter representation classes, the same number of distinct matter representation classes, 
and the same number of total fermions. 

This would remove the ordering dependence from the model comparison. Until it is 
proven, however, there is a risk of undercounting the models. Rather than risking this 
undercounting with the systematic studies presented herein, models in smaller data sets were 
examined, and duplicates were removed by hand. This will provide a systematic uncertainty 
estimate for statistics from larger data sets. 



II. THE NAHE SET 



The NAHE [T9| 162] set is a set of five order-2 basis vectors which have served as a 
common basis set for phenomenologically realistic WCFFHS models. These basis vectors 



are given in Table IV The massless particle spectrum is given in Table V The NAHE set 
particle content, in addition to the particles listed in Table [V] contains an N=l ST SUSY. 
The observable sector of the NAHE set is an SO{10) x S'f/(4)^ GUT group with three 
sixteen dimensional 5*0(10) matter representations serving as the matter generations. Each 
generation is charged under a different SU (4) gauge group, and there are two copies of each 
representation. There are two distinct representations with SO{10) charge 16 and S'f/(4) 
charge 4, since the SU (4) charge has a barred and unbarred representation. This brings 
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TABLE IV: The basis vectors and GSO coefficients of the NAHE set arranged into sets of 
matching boundary conditions. Nr is the order of the right mover. The elements ip, ip 
T] \ and (j) * are expressed in a complex basis, while x\ y\ w\ y \ and w * are expressed in 

a real basis. 



Sec 


Nr 








^.56 


V' 


1,...,5 


7] ^ 


T] 2 


r] ^ 




..,8 


1 


2 


1 


1 


1 


1 


1 




1 


1 


1 


1,... 


,1 


S 


2 


1 


1 


1 


1 





...,0 











0,... 


,0 


h 


2 


1 


1 








1 




1 








0,... 


,0 




2 


1 





1 





1 







1 





0,... 


,0 


h 


2 


1 








1 


1 










1 


0,... 


,0 



Sec 


Nr 




y 35...,6| \y 3,.. .56 


1 4i 1 — 1 4 
w ||u; ^'—'^ 


1 


2 


II 


II 


II 


s 


2 


o,..,o II 0,..,0 


o,...,o II 0,..,0 


o,..,o II 0,..,0 


bi 


2 


o,...,o II 0,...,0 


II 


o,...,o II 0,...,0 


b2 


2 


II 


o,...,o II 0,..,0 


o,...,o II 0,...,0 


h 


2 


o,...,o II 0,...,0 


o,...,o II o.,„.o 


II 





1 Sbi 


62 63^ 


1 


1 1 


1 1 


s 








h 


1 1 1 


1 1 




1 1 1 


1 1 


h 


1 1 1 





the total number of copies for each generation up to four. In addition, the dimension of 
the SU (4) charge itself is counted as being a set of copies of each generation, so the total 
number of copies of each generation is sixteen. There are no matter representations charged 
under the Eg group, so it is the designated hidden sector for this model. Extensions of the 
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TABLE V: The matter content of the model produced by the NAHE set. 



QTY 


SU{A) 


SU{4) 


SU{4) 


30(10) 


-^8 


2 


4 


1 


1 


16 


1 


2 


1 


4 


1 


16 


1 


2 


1 


1 


4 


16 


1 


2 


1 


1 


4 


16 


1 


1 


1 


1 


6 


10 


1 


2 


1 


4 


1 


16 


1 


1 


1 


6 


1 


10 


1 


1 


1 


6 


6 


1 


1 


2 


4 


1 


1 


16 


1 


1 


6 


1 


1 


10 


1 


1 


6 


1 


6 


1 


1 


1 


6 


6 


1 


1 


1 



NAHE set do not necessarily keep the designated observable sectors in the model. A model 
could, rather than break down the SO (10) gauge group, break the £^8 into an Eq, producing 
an Eq observable sector. In the past, when models were constructed individually, this was 
not common. Most individually constructed NAHE based models broke the SO{10) gauge 
group into a Pati-Salam {SU{A)® SU{2)^ SU{2)) group, SU{5)^U{1), or the MSSM gauge 
group {SU{3) ® SU{2) U{1)). In a systematic search that is no longer guaranteed, since 
the basis vectors are not organized by the researcher, but rather by the computer program. 
The notation for the next sections will change slightly; from here on the term layer will refer 
to the number of basis vectors after the initial set of five NAHE vectors rather than the 
total number of basis vectors. 
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III. STATISTICS FOR ORDER-2 LAYER-1 



The first set of statistics to be reported liere are for extensions to tfie NAHE set with 
a single basis vector of order 2. (Thus, extensions of this class can at best reduce the 
number of copies of each generation from 16 to 8.) The GSO coefficients were fixed for the 



NAHE set to those presented in Table [IV} The GSO coefficients for the extended basis vector 
were systematically generated such that all possible combinations consistent with modular 
invariance were built. This study was repeated for the NAHE set without the ST SUSY 
generating basis vector S, to determine the effect of ST SUSY on the models produced. 



A. With S 

There were 439 unique models produced out of 1,945,088 total models. Approximately 
9.5% of the models in the data set without rank-cuts were duplicates, and 13% of the models 
with rank-cuts were duplicates. All duplicates were removed prior to the statistical analysis. 
The frequency of the individual groups appearing in the unique models is presented in Table 



VI The first item of note is how many models retain at least one of the original gauge 
groups from the NAHE set. Approximately 77% of the models kept at least one S'f/(4) 
gauge group, while ~ 36% of the models kept their 5*0(10) and ~ 33% kept their E^. The 
most common gauge group in this set is SU{2), which is expected, since it is the lowest rank 
non-Abelian gauge group. About 17% have an SU{2Y^^ gauge group. In these models, this 
happens when a left moving mode is paired with a right moving mode. As mentioned earlier, 
left-right paired elements reduce the rank of the gauge lattice of the model and, hence, are 
referred to as rank cuts. The non-simply laced gauge group 5*0(5) also appears due to rank 
cuts. 

Also of interest is the number of models with a U{1) gauge group, which for this data 
set is quite high. f/(l)'s can be problematic when dealing with deeper phenomenology in a 
model. The more ?7(l)'s present in a model the more likely the model is to have anomalous 
charge. This anomalous charge must be dealt with by finding D- and F-fiat directions of 
the potential. However, the more U{1) charges present, the more fiat directions a model 
is likely to have, enabling more fiexibility when giving mass to observable sector charged 
exotics. Discussion of anomalous U{1) charges and fiat directions of these models will not 
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TABLE VI: The frequency of the individual gauge groups amongst the unique models for 
the NAHE + 02L1 data set. Gauge groups at Kac-Moody level higher than 1 are denoted 
with a superscript indicating the Kac-Moody level. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{2) 


365 


83.14% 


5C/(2)(2) 


73 


16.63% 


SU{4) 


338 


76.99% 


SU{6) 


2 


0.4556% 


SU{8) 


2 


0.4556% 


SO{5) 


155 


35.31% 


SO{8) 


141 


32.12% 


50(10) 


160 


36.45% 


50(12) 


2 


0.4556% 


50(14) 


3 


0.6834% 


50(16) 


147 


33.49% 


50(18) 


1 


0.2278% 


50(20) 


2 


0.4556% 


50(22) 


1 


0.2278% 


50 (24) 


1 


0.2278% 


50(26) 


1 


0.2278% 


Eq 


1 


0.2278% 




142 


32.35% 


Es 


144 


32.8% 


m 


332 


75.63% 



be reserved for future papers. 

Though most of the models have smaller individual gauge group components, some mod- 
els have gauge group enhancements. There are some models with SO{18), SO{20), 5*0(22), 
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FIG. 1: The number of gauge group factors for each model in the NAHE + 02L1 data set. 



S'0(24), and 5*0(26) gauge groups. Those groups have rank 9, 10, 11, 12, and 13, respec- 
tively, making them higher rank than any one of the NAHE set gauge groups. This occurs 
when an added basis vector bridges the gap between the mutually orthogonal sets of states 
of the original five basis vectors, unifying the root spaces of the individual groups into one 
larger group. For order-2 models, however, it is clear this is not common. 

Another way of measuring the enhancements that occur is looking at the number of gauge 
group factors in each model. Those are plotted in Figure [T} There is a definite peak at nine 
gauge group factors, much higher than the five initial gauge groups present in the NAHE 
set. Note that there are not many models with fewer than five gauge group factors. This 
implies that though there are several enhanced groups of higher rank than the initial NAHE 
set, the other gauge groups in the model remain broken. 

Relevant GUT groups and the number of unique models containing those groups is pre- 



sented in Table VII The relatively low number of SU{n + 1) groups (excluding SU{4)) 



explains the lack of GUT group models in this data set. 

The number of ST SUSYs are plotted against the number of unique models in Figure |2} 
Many of the basis vector extensions did not alter the ST SUSY, while about half reduced it. 
More interestingly, there is one model with enhanced ST SUSY. The basis vector for that 



model is presented in Table VIII The particle content of this model is presented in Table 
TX} The gauge groups of this model are identical to those of the NAHE set, but with fewer 
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TABLE VII: The number of unique models containing GUT groups for the NAHE + 02L1 

data set. 



GUT Group 


Number of Unique Models 


% of Unique Models 




1 


0.2278% 


50(10) 


160 


36.45% 


SU{5) ® U{1) 





0% 


SU{4) (g) SU{2) ® SU{2) 


243 


55.35% 


SU{3) SU{2) SU{2) 





0% 


SU{3)(^ SU{2)(^U{1) 





0% 




12 3 4 

Number of ST SUSYs 

FIG. 2: The number of ST SUSYs for the NAHE + 02L1 data set. 

matter representations, particularly with regard to the SU{A) charges. The enhanced ST 
SUSY comes from a new gravitino generating sector 6i + 62 + v, which contributes a single 
gravitino state to the model. The other gravitino comes from S. This example highlights the 
importance of systematic searches; the enhanced ST SUSYs come from very specific basis 
vectors that combine with the NAHE set to provide unexpected phenomenology. Though 
there are not a statistically significant number of models with this property, these models 
can highlight subtleties in the WCFFHS formulation that may go unnoticed in a random 
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TABLE VIII: A basis vector and kij matrix row which produces an enhanced ST SUSY 

when added the NAHE set. 



Sec 











^56 


- 1,...,5 


TJ ^ 


T] ^ 




- 1,...,8 


V 


2 


1 


1 


1 


1 


0,...,0 


1 


1 





0,...,0 



Sec 





"'^'^ly ^'^11?/ ^'^'W^ ^'^ 


^ S^.-.^Sjl^ 3,.. .,6 




V 


2 


o,o,i,i||i,i,i,i 


o,o,i,i||i,i,i,i 


0,.,0||0,...,0 



k^j = (0, 0, 0, 1, 1) 



TABLE IX: The particle content of the TV = 2 ST SUSY NAHE based model. 



QTY 


SU{4:) 


SU{A) 


SU{A) 


50(10) 


Es 


1 


1 


4 


1 


T6 


1 


1 


1 


4 


1 


16 


1 


1 


1 


1 


4 


T6 


1 


1 


1 


1 


4 


16 


1 


1 


1 


1 


4 


T6 


1 


1 


1 


1 


4 


16 


1 


1 


1 


4 


1 


16 


1 


1 


1 


4 


1 


16 


1 


2 


1 


6 


6 


1 


1 


2 


6 


1 


1 


10 


1 



search. 

The number of U{1) gauge groups are plotted against the number of unique models in 
Figure |3} The greater number of U{1) factors present in the model, the greater that model's 
capacity for carrying anomalous charge. There are relatively few U (l)'s in the models of this 
class. This is likely the result of the basis vectors having only periodic phases — nonzero, 
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12 3 4 

Number of U{1) Factors 

FIG. 3: The number of U{1) factors for the NAHE + 02L1 data set. 

non-periodic pliases break S0{2n) groups into SU{n — 1) f/ (1) groups in most cases. Thus, 
there are not many U{1) groups expected or found in this data set. 

Another phenomenological property which might have statistical significance is the num- 
ber of non-Abehan singlets, as non-Abelian singlets often carry observable sector hyper- 
charge. However, no such particle with this property has yet been observed. Additionally, 
singlet particles contribute to the mass-energy density of the model. Too many non-Abelian 
singlets could result in a mass-energy density higher than observed values, also producing 
bad phenomenology. The number of non-Abelian singlets for this data set is plotted in 
Figure |4} 

B. Without S 

Also of interest is the effect of the S vector in the set of NAHE basis vectors. Not only 
does the S vector generate ST SUSY, it also adds a degree of freedom to the kij matrix. 
It stands to reason that removing S will also have an effect on the massless gauge and 
matter content in addition to the supersymmetry. There are 282 unique models in the 
set of 1,940,352 consistent models, in contrast to the 439 models in the data set with S. 
About 8.4% of the models without rank cuts had duplicates, while 9.4% of the models with 
rank cuts were duplicates of other models produced. All duplicates were removed from the 
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FIG. 4: The number of non-Abelian singlets in the NAHE + 02L1 data set. 



statistics to follow. Moreover the 282 models in this set do not all belong to the = 
models in Figure |2} In fact, the two data sets have only 8 models in common, implying that 
the presence or non-presence of S has an effect on more than the ST SUSY. 

The gauge content of the order-2 layer-1 NAHE extensions without the S vector are 
presented in Table |X| A side-by-side comparison of the gauge group content percentages is 



shown in Table XL The similarities between the gauge content of these two data sets are 
striking. It is tempting to assume that ignoring ST SUSYs when determining uniqueness 
will result in the data sets being identical, since the ST SUSY generator S is the only real 
difference between the data sets. This is not the case. In fact, ignoring ST SUSYs when 
comparing the intersection of these two sets of models gives the same number of models 
common to both sets: 8. This implies that the matter representations are affected by 
whether 5* is in the set of basis vectors making up a model. What is likely occurring in the 
models with 5* is that the sector coming from S is contributing non-adjoint representations 
to the matter states of the model. In other words, gravitinos are not the only fermion 
states coming from S. In order to begin fully mapping the heterotic landscape the full effect 
of leaving out the S vector should be examined, as its presence may affect the likelihood 
of models with three chiral generations. Other statistics from this data set will now be 
presented for completeness. 



The number of gauge group factors, ST SUSYs, and U{1) factors are plotted in Figure |5 
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TABLE X: The gauge content of the NAHE + 02L1 data set without S. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{2) 


233 


82.62% 


S'C/(2)(2) 


67 


23.76% 


S'C/(4) 


212 


75.18% 


SU{6) 


2 


0.7092% 


SU{8) 


2 


0.7092% 


SO{5) 


107 


37.94% 


SO{8) 


89 


31.56% 


50(10) 


100 


35.46% 


50(12) 


2 


0.7092% 


50(14) 


3 


1.064% 


50(16) 


83 


29.43% 


50(18) 


1 


0.3546% 


50(20) 


2 


0.7092% 


50(22) 


1 


0.3546% 


50(24) 


1 


0.3546% 


50(26) 


1 


0.3546% 




1 


0.3546% 


Ej 


95 


33.69% 


Es 


98 


34.75% 


[/(I) 


209 


74.11% 



None of the models had any non-Abelian singlets, suggesting that the non-Abelian singlets 
may be coming from the S sector. The occurrances of GUT groups are charted in Table 

m 
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TABLE XI: A side-by-side comparison of the gauge content for NAHE -|- 02L1 with and 

without S. 



Gauge Group 


With S 


Without S 


SU{2) 


83.14% 


82.62% 


5?7(2)(2) 


16.63% 


23.76% 


SU(A) 


76.99% 


75.18% 


SU(6) 


0.4556% 


0.7092% 


SU{8) 


0.4556% 


0.7092% 


S0{5) 


35.31% 


37.94% 


SO(8) 


32.12% 


31.56% 


SO(10) 


36.45% 


35.46% 


50(12) 


0.4556% 


0.7092% 


SO(U) 


0.6834% 


1.064% 


SO{16) 


33.49% 


29.43% 


sons) 


0.2278% 


0.3546% 


SO(20) 


0.4556% 


0.7092% 


50(22) 


0.2278% 


0.3546% 


30(24) 


0.2278% 


0.3546% 


SO{26) 


0.2278% 


0.3546% 


Eq 


0.2278% 


0.3546% 


E7 


32.35% 


33.69% 




32.8% 


34.75% 


m 


75.63% 


74.11% 



IV. STATISTICS FOR ORDER-3 LAYER-1 

The next set of statistics to be reported are for extensions to the NAHE set with a single 
basis vector of right moving order-3. The order-3 basis vectors added are fermion sectors; 
the left movers are order- 2. Since the orders of the left and right movers are not the same, 
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FIG. 5: Statistics for the NAHE + 02L1 data set without S. 



the total order of the basis vector extensions is 6. The difference between order-3 basis 
vectors of this type and true order-6 basis vectors is that all six possibilities for the phases 
will appear in an order-6 right mover, while only three phases appear in an order-3 right 
mover. The coefficients generating the sectors from these basis vectors still range from to 
5, however. This has interesting effects on the fermion spectrum, which have been noted in 
|63j . Statistics will be presented for models of this type both with and without S. 



With S 



The presence of S in the NAHE set for this search causes any order-3 basis vector with 
the same left mover as S to be inconsistent. This is due to the | |Zf symmetries of the left 
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TABLE XII: The number of unique models containing GUT groups for the NAHE + 02L1 

data set without S. 



GUT Group 


Number of Unique Models 


% of Unique Models 




1 


0.3546% 


50(10) 


100 


35.46% 


SU{5) U(l) 





0% 


SU{A) SU{2) SU{2) 


156 


55.32% 


SU{3) SU{2) SU{2) 





0% 


SU{?,)®SU{2)®U{1) 





0% 



and right mover. Adding S to three times the basis vector extension results in a second 
sector, which means the set of basis vectors is not linearly independent. Moreover, since the 
right mover does not have any periodic phases, there can be no rank-cutting in these models. 
Non-simply laced gauge groups and higher level Kac- Moody algebras are not present. 

Despite having no linearly independent basis vectors with the same left movers as 5, this 
data set contains quite a bit more distinct models. There were 373,152 models in this set, but 
only 3,036 were unique. This relatively (compared to the order-2 extension) high number of 
unique models suggests that the periodic /anti-periodic phases of the order-2 models have a 
redundancy not present in models of this type. Based on the double counting of the order-2 
models without rank-cuts, the estimated systematic uncertainty for the order-3 statistics is 



10%. The gauge content of these models is presented in Table XIII The most noticeable 



difference between Table XIII and Table VI is the number of SU (n + l)-type gauge groups. 
The GSO projections of the new sector break the untwisted (0) sector from 5*0(44) to 
smaller 5'0(2n)-type groups. Phases that are neither periodic nor anti-periodic transform 
these groups from S0{2n) to SU{n) ® U{1). Since all of the phases in this set fall into 
that category, the S'0(2n)-type groups appear when the states from the added sector are 
projected out by certain kij matrix choices. The SU{n + l)-type groups are created when 
the contributions from the added sector are left in the model. The number of gauge group 
factors per model are plotted in Figure [6] As with Figure [T| there are still very few models 
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TABLE XIII: The gauge group content of the NAHE + 03L1 data set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{2) 


2587 


85.21% 


SU{3) 


923 


30.4% 


SU(4) 


2241 


73.81% 


SU{5) 


543 


17.89% 


SU{6) 


735 


24.21% 


SU{7) 


215 


7.082% 


SU{8) 


460 


15.15% 


SU{9) 


76 


2.503% 


SU{10) 


76 


2.503% 


su{n) 


17 


0.5599% 


SU{12) 


41 


1.35% 


SU{13) 


3 


0.09881% 


SU{U) 


5 


0.1647% 


50(8) 


860 


28.33% 


50(10) 


659 


21.71% 


50(12) 


400 


13.18% 


50(14) 


372 


12.25% 


50(16) 


260 


8.564% 


50(18) 


11 


0.3623% 


50(20) 


33 


1.087% 


50(22) 


5 


0.1647% 


50(24) 


15 


0.4941% 


50(26) 


3 


0.09881% 


Eg 


193 


6.357% 




147 


4.842% 


Es 


80 


2.635% 


U{1) 


2955 


97.33% 
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FIG. 6: The number of gauge group factors per model in the NAHE + 03L1 data set. 



TABLE XIV: The number of unique models containing GUT groups for the NAHE + 

03L1 data set. 



GUT Group 


Number of Unique Models 


% of Unique Models 




193 


6.36% 


50(10) 


659 


21.71% 


SU{b) ® U{1) 


543 


17.89% 


SC/(4) SU{2) SU{2) 


1648 


54.28% 


SU{2,) SU{2) (g) SU{2) 


628 


20.69% 


SU{3)(gSU{2)(gU{l) 


775 


25.53% 



which have less than five gauge group factors. There are also peaks at 10, 11 and 12 gauge 
group factors, as opposed to just a single peak at 9 in Figure [T] 

Relevant GUT groups and number of unique models containing those groups are presented 
in Table XIV The number of ST SUSYs for the order-3 data set is plotted in Figure [7} 



Notice that a statistically significant number of models have enhanced ST SUSY. This trend 
is expected, as every odd-ordered right mover with a massless fermion left mover will produce 
an additional gravitino generating sector in the model. 
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FIG. 7: The ST SUSYs for the NAHE + 03L1 data set. 
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FIG. 8: The number of U{1) factors for the NAHE + 03L1 data set. 



Table XIII also makes apparent the number of models containing U{1) gauge groups. 
The number of f/(l)'s per model are plotted in Figure [s] It is clear that most models have 
multiple U{1) factors, and that there are more U{1) factors per model for this data set 
than the 02L1 data set. The number of non-Abelian singlets are plotted in Figure [9| As 
with the order-2 extensions, the distribution of non-Abelian singlets drops off sharply after 
0, indicating the NAHE-base single-layer models do not have a tendency to produce many 
non-Abelian singlets. 
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FIG. 9: The number of non-Abelian singlets for the NAHE + 03L1 data set. 



B. Without S 



Removing S from the NAHE set for the order-2 layer- 1 extensions had interesting con- 
sequences on the available matter sectors. For the order-3 layer-1 extensions the effect is 
expected to be more drastic, as linear independence prevented any models with the same 
left mover as S to exist in a model. There should be less of an impact on the ST SUSY, 
however, since order-3 basis vectors with massless left movers always produce their own 
gravitino generating sector. The lower number of possibilities for kij values will also have 
an effect on the number of models in the set. 

There are 447 unique models in this set out of 870,688 consistent models, and of those 
146 models also belong to the data set with S. Based on the number of duplicates in the 
02L1 data set (without 5*), the estimated systematic uncertainty for these statistics is 10%. 
There is significantly more overlap between the two sets, yet there are also significantly fewer 
unique models. The gauge group content of the NAHE+03L1 without S is presented in 
Table 



XV 



A brief comparison between Tables XV and XIII makes it clear the presence of 5* 



did not significantly affect the gauge groups, as was the case with the order-2 extensions. For 
completeness the occurances of the GUT groups and other relevant statistics for this data 
set will be presented. The occurrances of the GUT groups in this data set are tabulated in 



Table XVI The number of gauge group factors, U{1) factors, ST SUSYs, and non-Abelian 
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TABLE XV: The gauge group content of the NAHE + 03L1 data set without S. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU(2) 


368 


82.33% 


SU{3) 


128 


28.64% 


SU(4) 


313 


70.02% 


SU{5) 


70 


15.66% 


SU{6) 


96 


21.48% 


SU{7) 


26 


5.817% 


SU{8) 


71 


15.88% 


SU{9) 


15 


3.356% 


SU{10) 


14 


3.132% 


su{n) 


3 


0.6711% 


SU{12) 


8 


1.79% 


SU{13) 


1 


0.2237% 


SU{U) 


1 


0.2237% 


50(8) 


124 


27.74% 


50(10) 


97 


21.7% 


50(12) 


53 


11.86% 


50(14) 


49 


10.96% 


50(16) 


42 


9.396% 


50(18) 


3 


0.6711% 


50(20) 


6 


1.342% 


50(22) 


1 


0.2237% 


50(24) 


2 


0.4474% 


50(26) 


1 


0.2237% 


Eg 


37 


8.277% 




32 


7.159% 


Es 


16 


3.579% 


U{1) 


430 


96.2% 
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singlets are plotted in Figure [TO 



TABLE XVI: The occurrances of the GUT groups for the NAHE + 03L1 data set without 

S. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


Eq 


37 


8.277% 


50(10) 


97 


21.7% 


SU{5) ^ U{1) 


70 


15.66% 


SUiA) SU{2) (g) SU{2) 


220 


49.21% 


SU{2,) ® SU{2) SU{2) 


81 


18.12% 


SU{3)(gSU{2)(gU{l) 


100 


22.37% 



V. MODELS WITH GUT GROUPS 

The next several sections will outline statistics on models containing GUT groups from 
the NAHE + 02L1 and NAHE + 03L1 data sets (with S). The GUT groups to be examined 
are Eg, SO{10), SU{5)®U{1), SU{A)®SU{2)®SU{2) (Pati-Salam), SU{3)®SU{2)®SU{2) 
(Left- Right Symmetric), and SU{3) O SU{2) ® f/(l) (MSSM). In addition to the spread of 



statistics presented in sections III and IV, the number of chiral fermion generations will be 
counted, along with any exotics which carry observable sector charge. These statistics will 
be gathered for all possible observable sector configurations. Thus, if there is more than one 
copy of any GUT group in a model (as is often the case), all possible choices for each group 
forming the observable sector will be examined. 

The definition of a chiral matter generation will be presented with each group for clarity, 
and only the "net" chiral generations will be counted. That is, if there are an equal number 
of barred and unbarred generations, the model will have no net chiral generations. Any 
"net" generations which cannot be paired must remain massless until the Higgs boson gains 
a VEV at the TeV scale. 

The statistics gathered on the chiral generations here are not enough to qualify a model 
or set of models as being "realistic." This study looks only to examine the basic components 
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FIG. 10: Statistics for the NAHE + 03L1 data set without S. 



that sometimes lead to realistic models. Actually determining whether or not a model 
is realistic requires detailed analysis of the charges, finding the superpotential, and 

finding the D- and F-fiat directions. Progress is being made to automate the above steps 
and integrate the deeper phenomenological components of WCFFHS model building into the 
FF Framework. For the present analysis, however, discussion of these aspects of WCFFHS 
phenomenology is omitted. 
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A. Eq Models 



Each SM generation of fermions fits into a 27 dimensional representation of Eq, so tlie 
number of net chiral matter generations is given by 

\N27-N^\. (10) 

Additionally, for large GUT groups, states which transform under the "hidden sector" can be 
treated as being multiple copies of an observable generation. The hidden sector groups can 
be broken somewhat easily by adding basis vectors, so for certain GUT groups the dimension 
of the hidden sector charge is treated as the number of duplicate observable generations. 

Statistics will now be presented for Eq models coming from single layer extensions to the 
NAHE set. There was only one model with an Eq group in the NAHE + 02L1 data set, 
but there were 193 models with Eq in the NAHE + 03L1 data set. The number of net 



chiral fermion generations with and without hidden sector duplicates is plotted in Figure 11 



along with the number of observable sector charged exotics. The hidden sector gauge group 



content of these models is presented in Table XVII Of note in this table is the presence 
of SU{5) and SU{3) hidden sector gauge groups in large quantities. This result stands in 
contrast to those found in the order-2 search performed in [58j, and is likely due to the 
extension being an order-3 basis vector. Similar conclusions were also found in |6l] . Order-3 
basis vector extensions have very different properties from order-2 and order-4 basis vectors. 
Figures [TT] shows that the distribution of net fermion generations tends to be at zero; either 
no 27's are produced, or every 27 is accompanied by a 27 for those models. For models with 
more than zero chiral fermion generations, the number of generations per model is even 



both with and without hidden sector duplicates. It is also apparent from Figure [TT] that 
the Eq charged fermions do not couple to the hidden sector in most of these models. These 
examinations make it clear that, though most models do not contain exotic states, there are 
never the correct number of chiral fermion generations to produce a realistic model. 

For completeness the rest of the statistics for this data set will be presented. The number 
of gauge group factors per model, the number of U{1) factors, the number of ST SUSYs, 



and the number of non-Abelian singlets are presented in Figure [12 
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FIG. 11: Statistics related to the chiral matter generations for Eq models in the NAHE + 

03L1 data set. 



B. 50(10) Models 

In the models containing 50(10), the chiral fermion generations are defined to be in the 
16 dimensional representations. Thus, the number of net chiral fermion generations is given 
by 

l^^ie-iVlel- (11) 

The NAHE + 02L1 data set has 160 models with SO(IO), while the NAHE + 03L1 data 
set has 659 models. The number of chiral fermion generations with hidden sector charges 



is plotted in Figure 13 for the NAHE + 02L1 and NAHE + 03L1 data sets. The number 



of chiral fermion generations without hidden sector charges is also plotted in Figure [13] for 
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TABLE XVII: The hidden sector gauge group content of models containing Eq within the 

NAHE + 03L1 data set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{2) 


165 


85.49% 


SU{3) 


54 


27.98% 


SU{A) 


113 


58.55% 


SU{5) 


39 


20.21% 


SU{6) 


43 


22.28% 


SU{7) 


5 


2.591% 


SU{8) 


10 


5.181% 


SO{8) 


35 


18.13% 


50(10) 


41 


21.24% 


50(12) 


12 


6.218% 


50(16) 


19 


9.845% 


Es 


11 


5.699% 


U{1) 


193 


100% 



the order-3 5*0(10) models. There were no order-2 50(10) models with more than zero 
chiral fermion generations. The hidden sector gauge groups for the NAHE + 02L1 50(10) 



models are presented in Table XVIII The NAHE + 03L1 50(10) model hidden sector gauge 



groups are presented in Table |XIX[ The number of observable exotic states for these models 

As with the Eq data sets, 



is plotted in Figure [14] for the order-2 and order-3 models, 
there are no models with three chiral generations. The number of generations in any of the 
models presented is either zero or even. The distribution of charged exotics is more spread 
out than it was for the Eq models. This is an artifact of the NAHE set gauge group; the 
5*0(10) states in that set are charged under the three 5'[/(4) gauge groups. Most observable 
50(10) states tend to keep some of those charges under the hidden sector. The remaining 



statistics for these models are presented in Figure 15 for the order-2 models. The remaining 



statistics for the order-3 models with 50(10) are presented in Figure 16 



31 




1 2 3 4 5 10 15 

Number of ST SUSYs Number of NA Singlets 



(c) (d) 
FIG. 12: Statistics for the models containing Eq in the NAHE + 03L1 data set. 

C. SU{5) ® U{1) Models 

The (flipped) SU (5) GUT group's matter generations are spht into multiple representa- 
tions of the SU{5) group. An anti-lcpton doublet and the up-type quarks arc placed in a 5 
representation, while the right-handed neutrino, the anti-quark doublet, and the down-type 
quarks appear in a 10 dimensional representation of SU{5). Thus, a generation is formed 
by pairing the 10-reps with the 5-reps. The net number of generations for an SU{5) (g) C/(l) 
model is given by 

|min(iVio, iVg) - min(Ar^, N,)\. (12) 

There were no order-2 NAHE based models containing SU{5) (8) U{1), but there were 543 
order-3 models with this GUT group. The hidden sector gauge groups of those models are 
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FIG. 13: Statistics for the chiral matter generations of the 50(10) models in the NAHE + 

02L1 and NAHE + 03L1 data sets. 



presented in Table XX The number of net chiral generations with and without hidden sector 



duplicates are plotted in Figure 17, along with the number of exotic states with observable 
sector charges. The most striking feature of this data set is that there are models with three 
chiral generations both with and without hidden sector duplicates. The significance of this 
finding is that these models do not have rank cuts, and thus carry a geometric interpretation. 
This implies that these models may be written in another construction method, particularly 
that of orbifolding [llj. These are the first three-generation models of their kind. More 
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TABLE XVIII: Hidden sector gauge groups for ,50(10) models in the NAHE + 02L1 data 

set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{2) 


121 


75.62% 


5C/(2)(2) 


25 


15.62% 


SU(4) 


111 


69.38% 


SO(5) 


59 


36.88% 


SO{8) 


1 


0.625% 


50(14) 


1 


0.625% 


SO(16) 


56 


35% 


50(20) 


1 


0.625% 


SO{22) 


1 


0.625% 


E7 


51 


31.87% 


Es 


51 


31.87% 


('{1) 


102 


63.75%: 
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FIG. 14: The number of observable sector charged exotics for S0{1Q) models in the NAHE 

+ 02L1 and NAHE + 03L1 data sets. 
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TABLE XIX: Hidden sector gauge groups for 50(10) models in the NAHE + 03L1 data 

set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{2) 


570 


86.49% 


SU{3) 


126 


19.12% 


SU{A) 


456 


69.2% 


SU{5) 


74 


11.23% 


SU{6) 


132 


20.03% 


SU{7) 


20 


3.035% 


SU{8) 


64 


9.712% 


SU{9) 


5 


0.7587% 


SU{10) 


15 


2.276% 


SU(12) 


12 


1.821% 


SO(8) 


105 


15.93% 


50(12) 


75 


11.38% 


50(14) 


34 


5.159% 


50(16) 


69 


10.47% 


50(20) 


13 


1.973% 


50(22) 


5 


0.7587% 


Eq 


41 


6.222% 


El 


29 


4.401% 


Es 


20 


3.035% 


U{1) 


604 


91.65% 



analysis will be done in Sec. VI regarding this new class of three-generation models. Other 
statistics for this data set are plotted in Figure [18 
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FIG. 15: Statistics for the SO{10) models in the NAHE + 02L1 data set. 



D. Pati-Salam Models 

Pati-Salam models consist of models with a gauge group SO{6)<Si SO{4:), which is isomor- 
phic to the gauge group SU (4) SU (2) (g) SU (2). The latter is the form of this gauge group 
which appears in WCFFHS models. The quark and lepton generations are in representations 
of (4,2,1), while the anti-quarks and anti-leptons are in representations of (4,1,2). Because 
these data sets are formed by examining all permutations of possible observable sectors, the 
same statistics will emerge when the chiral generations and anti-generations are examined 
separately. As there are no three-generation models in this data set, this case will not be 
considered. The equation for the number of net chiral generations is 

|A^(4,2,1)-%,2,1)|- (13) 
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FIG. 16: Statistics for the SO{10) models in the NAHE + 03L1 data set. 



There are 243 unique models containing this gauge group in the order-2 NAHE extensions, 
and there are 1,648 unique models with this gauge group in the order-3 NAHE extensions. 
The abundance of these models is expected; as they contain the most common non-Abelian 
gauge group, SU{2), along with a group that the NAHE set model already has. 

No order-2 models with the Pati-Salam gauge group have net chiral matter generations, 
but other statistics related to those models are presented, as are order-3 model statistics. 



For the order-2 models, the hidden sector gauge group content is given in Table XXI In this 
data set, both SU{2) at KM-level 1 and Sf/(2)(2) at KM-level 2 are included as possibilities 
for the observable Pati-Salam gauge group. The number of charged exotics is presented 



in Figure 19 The hidden sector gauge group content for the order-3 models is presented 



in Table XXII The number of chiral generations and observable sector charged exotics for 
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FIG. 17: Statistics related to observable matter for the SU{5) (g) U{1) models in the NAHE 

+ 03L1 data set. 



these models are plotted in Figure 20 



Note that the number of chiral generations is zero in most cases for the NAHE + 03L1 
data set as well. This implies that the symmetry breaking to the two SU (2) gauge groups 
splits the three distinct generations of the NAHE observable sector evenly. Thus, three gen- 
eration Pati-Salam models likely require more complicated basis vector sets. The remaning 



statistics are presented in Figure [21) Statistics for the NAHE + 03L1 data set are presented 
in Figure |22 
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TABLE XX: The hidden sector gauge groups of the SU{5) O U{1) models in the NAME + 

03L1 data set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{2) 


449 


82.69% 


SU(3) 


468 


86.19% 


SU(4) 


284 


52.3% 


SU(6) 


52 


9.576% 


SU{7) 


50 


9.208% 


SU{8) 


52 


9.576% 


SU(9) 


22 


4.052% 


SU{10) 


5 


0.9208% 


50(8) 


57 


10.5% 


S'O(IO) 


74 


13.63% 


50(12) 


35 


6.446% 


SO{U) 


72 


13.26% 




39 


7.182% 




10 


1.842% 



E. Left-Right Symmetric Models 

The final GUT considered in this study is a derivative of the Pati-Salam GUT group 
referred to as the Left-Right Symmetric group. It retails the du&l-SU (2) nature of the Pati- 
Salam GUT, but the SU (4) gauge group is broken into an SU (3) group directly representing 
the strong force. The generations of quarks fit into a (3,2,l)-dimensional representation while 
the generations of anti-quarks fit into a (3,l,2)-dimensional representation. The lepton and 
anti-lepton generations are placed in a (1,2,1) and (1,1,2) representation, respectively. As 
the quark generations are usually more constraining in WCFFHS models, the term chiral 
matter generation refers only to the quarks, while the term chiral anti-generation refers only 
to the anti-quarks. Lepton generations will need to be taken into account when considering 
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FIG. 18: Statistics for the SU{5) ® [/(I) models in the NAHE + 03L1 data set. 



a quasi-reahstic model, but here statistics will be gathered only with respect to the quark 
generations for simplicity. 

The number of net chiral (anti)generations is given by 

|iV(3,2,l)-%,2,l)|- (14) 

Since the statistics loop over all possible observable sector configurations, the statistical data 
on the net number of chiral generations and anti-generations are identical. There are no 
models with the gauge group in the NAHE + 02L1 data set, as that data set contains no 
models with SU{3) gauge groups. There are 628 distinct models in the NAHE + 03L1 data 
set with this gauge group. The hidden sector gauge content of those models is presented in 



Table |XXIII[ The number of net chiral generations is presented in Figure |23] along with the 
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TABLE XXI: The hidden sector gauge group content in Pati-Salam models from the 

NAHE + 02L1 data set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SO{5) 


60 


24.69% 


SO{8) 


51 


20.99% 


50(10) 


61 


25.1% 


50(12) 


1 


0.4115% 


50(16) 


80 


32.92% 


50(20) 


2 


0.823% 


Ej 


86 


35.39% 


Es 


75 


30.86% 


U{1) 


185 


76.13% 



number of observable sector charged exotics. Like the SU{5) ® U{1) data set, there are 
three-generation models present here. There are 70 models with three net chiral generations. 
One such model will be presented as an example at the end of the paper. The remaining 



statistical information on these models is presented in Figure 24 



F. MSSM-like Models 

The MSSM0 gauge group is SU{3) O SU{2) O U{1). A generation of quarks fit in a (3,2) 
representation of these groups. The leptons fit in a (1,2) representation. The generations of 
antimatter are charged differently, as the antiparticles do not have isospin. A generation of 
antimatter consists of two (3,1) representations, one for the "up" -type quarks and one for 
the "down" -type quarks. While the leptons fit into a (1,2) representation, the anti-leptons 
are (1,1) singlets. As was the case with Left-Right Symmetric models, the terms chiral 
generation and anti-generation refer only to the quarks. While the lepton generations must 
also be considered, statistics are only gathered for the quark generations, as they are more 
Here MSSM refers only to the gauge group content. Models with this gauge group may or may not have ST 
SUSY. 
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FIG. 19: The number of observable sector charged exotics from Pati-Salam models in the 

NAHE + 02L1 data set. 
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FIG. 20: Statistics related to observable matter in the Pati-Salam models from the NAHE 

+ 03L1 data set. 



constraining. 

The equation for the number of net chiral matter generations is 

|-/V(3,2) - %,2)h 

while the number of net chiral antimatter generations is 

\N(^3,l) - %,i)l- 
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TABLE XXII: The hidden sector gauge group content of the Pati-Salam models in the 

NAHE + 03L1 data set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{3) 


344 


20.87% 


SU{5) 


174 


10.56% 


SU{6) 


414 


25.12% 


SU{7) 


116 


7.039% 


SU{8) 


214 


12.99% 


SU{9) 


30 


1.82% 


SU{W) 


31 


1.881% 


su{n) 


5 


0.3034% 


SU{U) 


24 


1.456% 


SO(8) 


422 


25.61% 


50(10) 


332 


20.15% 


50(12) 


255 


15.47% 


50(14) 


163 


9.891% 


50(16) 


106 


6.432% 


50(20) 


33 


2.002% 


Eq 


81 


4.915% 


El 


56 


3.398% 


Es 


15 


0.9102% 


U{1) 


1615 


98% 



There are no models with this gauge group from the NAHE + 02L1 data set since there 
are no SU{3) gauge groups. There are, however, 775 models in the NAHE + 03L1 data 
set with the MSSM group. The hidden sector gauge group content of models containing the 



MSSM gauge group is presented in Table XXIV The number of net chiral generations and 



anti-generations are presented in Figure 25 The number of observable sector charged exotics 
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FIG. 21: Statistics for the Pati-Salam models in the NAHE + 02L1 data set. 



is also plotted in Figure [25j There are models with three chiral matter generations, as well 
as models with three anti-generations. However, none of the models have three generations 
of quarks and anti-quarks. While these findings are still significant due to their novelty, 
they do not point towards phenomenologically realistic models as the SU{5) ® U{1) and 
Left-Right Symmetric models do. The remaining statistics for these models are presented 
in Figure |26j 



G. ST SUSYs 



There is a trend regarding the number of ST SUSYs in GUT models — that distributions 
of ST SUSYs for the most part do not change. Figure [27] contains the ST SUSY distributions 
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FIG. 22: Statistics for the Pati-Salam models in the NAHE + 03L1 data set. 



for the full data set, the 5*0(10) models, and the Pati-Salam models. It is clear that the 
ST SUSY distributions are relatively even for each of the sample sets of models. The 
same can be said of order-3 models, whose ST SUSY distributions are presented in Figures 



28] and [29j Only the Eq models display any sort of statistical coupling to the number of 
ST SUSYs, having significantly more N=2 models than any other data set. The other 
samples, however, have nearly identical distributions, suggesting that the number of ST 
SUSYs is not statistically linked to the GUT group content. Further investigations of these 
findings show several statistical couplings for higher ST SUSY models containing certain 
gauge group factors. The statistical test to be used invokes the Central Limit Theorem, 
which is applicable to populations which are well behaved, such as the number ST SUSY 
distributions discussed. We will average the number of ST SUSYs per model for models 
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TABLE XXIII: The hidden sector gauge group content of the Left- Right Symmetric 

models in the NAHE + 03L1 data set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{A) 


344 


54.78% 


SU(5) 


264 


42.04% 


SU(6) 


112 


17.83% 


SU(7) 


149 


23.73% 


SU{8) 


84 


13.38% 


SU{9) 


29 


4.618% 


5f/(10) 


10 


1.592% 


su{n) 


17 


2.707% 


50(8) 


89 


14.17% 


SO{W) 


95 


15.13% 


SO{U) 


53 


8.439% 
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FIG. 23: Observable matter statistics for the Left- Right Symmetric models in the NAHE 

-I- 03L1 data set. 
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FIG. 24: Statistics for the Left-Right Symmetric models in the NAHE + 03L1 data set. 



containing each of the gauge groups present. Random sample averages will be close to the 
average of the population; therefore any sample drawn based on gauge groups which has 
an average close to the population average indicates that the average number of ST SUSYs 
is not coupled to the gauge group. If the average number of ST SUSYs per model for a 
particular gauge group (for example, Eq) is higher than the population (and if the sample is 
large enough to be significant), then we can conclude that the gauge group content has an 
effect on the number of ST SUSYs for a significant percentage of models. These significances 



are plotted in [Figure 30| for the NAHE + 02L1 data set and [Figure 31| for the NAHE + 
03L1 data set. 

While there are no significant gauge groups in the NAHE + 02L1 data set, all three 
exceptional groups, as well as SU{9) and SU {12), have significant effects on the average 
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TABLE XXIV: The hidden sector gauge group content of the MSSM models in the NAHE 

+ 03L1 data set. 



Gauge Group 


Number of Unique Models 


% of Unique Models 


SU{A) 


412 


53.16% 


SU(5) 


374 


48.26% 


SU(6) 


112 


14.45% 


SU(7) 


169 


21.81% 


SU{8) 


112 


14.45% 


SU{9) 


41 


5.29% 


5f/(10) 


10 


1.29% 


su{n) 


17 


2.194% 


50(8) 


97 


12.52% 


S'O(IO) 


111 


14.32% 


50(12) 


35 


4.516% 


SO{U) 


68 


8.774% 


Eq 


46 


5.935% 



number of ST SUSYs. This is hkely due to the most common embeddings of the SUSY 
sectors causing additional roots in the gauge groups, promoting them from SU {n + 1) to 
S0{2n) or En. Additional analysis will be needed to confirm the cause of this significance. 
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FIG. 25: Observable matter related statistics for the MSSM models in the NAHE + 03L1 

data set. 
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FIG. 27: The distributions of ST SUSYs for the NAHE + 02L1 GUT group data sets. 
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FIG. 28: Some of the distributions of ST SUSYs for the NAHE + 03L1 GUT group data 

sets. 



52 



,1,500 



1.000 

s 

^ 500 



•Z, 



1.445 




286 

I 



1 2 .'J 

Number of ST SUSYs 

(a) Full data set 



200 



100 



262 



243 



38 



1 2 3 

Number of ST SUSYs 



CD 

o 

§ 600 



a 

m 

a 



400 



I 200 

s 

3 
"Z 



794 



730 



124 



1 2 3 

Number of ST SUSYs 



(b) SU{5) (g) i7(l) 



(c) Pati-Salam models 



m 300 
"a; 
•a 

o 

s 

I 200 



Q 

o 



3 



100 



304 



278 



46 



12 3 
Number of ST SUSYs 



400 



>| 300 



Q 

o 



3 



200 



100 



376 



345 



54 



12 3 
Number of ST SUSYs 



(d) Left-Right Symmetric models 



(e) MSSM models 



FIG. 29: The remaining distributions of ST SUSYs for the NAHE + 03L1 GUT group 

data sets. 
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FIG. 31: The significance values for models in the NAHE + 03L1 data set. Any 
significance values greater than three indicate a strong statistical significance. 



55 



VI. THREE GENERATION MODELS WITH A GEOMETRIC INTERPRETA- 
TION 



Several models containing three net chiral matter generations with SU{5)^U{1) and Left- 
Right Symmetric GUT groups were found in the NAHE + 03L1 data set. As previously 
mentioned, this finding is novel because these models do not have rank-cuts, and thus have 
a geometric interpretation. The usual statistics will be reported for these models, and a 
potentially realistic model from each gauge group will be presented as an example. To 
determine the viability of these models, more phenomenology must be done. In particular, 
finding the U{1) charges and the superpotential would be the first step, then the D- and 
F-flat directions can be found. If the flat directions can eliminate the observable sector 
charged exotic matter simultaneously with the anomalous U{1) charge, then the model 
could be considered a quasi-realistic model. 

A. A Three Generation SU{5) <S> U{1) Model 



Presented in this section is an explicit example of a NAHE based three-generation SU{5)(, 
f/(l) model with = 1 ST SUSY. The gauge group for this model is SU{3)^ SU{A) ( 



SU{5) ® SU{6) ® U{iy. Table XXV shows the basis vectors, and Table XXVI shows the 



TABLE XXV: A basis vector and kij matrix row which produces a three-generation 

SU{5) (g) U{1) model 



Sec 


Nr 


i; 




2,34 


^,56 


- 1,...,5 


r] ^ 


7] ^ 


7] ^ 


- 1,...,8 


V 


3 


1 


1 








0- - 


2 
3 





2 
3 


0- - 

u,u, 3 , . . . , 3 



Sec 





y ^'"^w ^'^\\y ^'"^w ^'^ 


y 3,...56 j j ^ 3,.. .,6 


W \\w -^'■■■'^ 


V 


3 


0,0, 1,1||0, 0,0,0 


nnnn||2 2 2 2 

1 3 , 3 , 3 , 3 


nnii||222 2 

U,U,i,X| 13,3,3, 3 



kffj = (0, 1, 1, 0, 1) 



particle content. The observable sector matter is tabulated in Table XXVII There are 
no (10,5) generations and three (10,5) generations in this model, giving it three net chiral 
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TABLE XXVI: Particle content for the three-generation SU{5) « U{1) model. This model 

also has five U (1) groups and = 1 ST SUSY. 



QTY 


5[/(3) 


SU{3) 


SU{4:) 


SU{5) 


SU{7) 


2 


3 


1 


1 


1 


1 


3 


3 


1 


1 


1 


7 


3 


3 


3 


1 


1 


1 


1 


1 


3 


1 


1 


7 


1 


1 


3 


1 


1 


1 


1 


1 


3 


1 


1 


7 


2 


1 


3 


1 


5 


1 


6 


1 


1 


6 


1 


1 


2 


1 


1 


4 


1 


1 


1 


1 


1 


4 


5 


1 


2 


1 


1 


1 


5 


1 


2 


1 


1 


1 


1 


21 


6 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


21 


3 


1 


1 


1 


5 


1 


2 


1 


1 


4 


1 


1 


1 


1 


1 


4 


5 


1 


2 


1 


3 


1 


5 


1 


1 


1 


3 


1 


1 


7 


2 


1 


3 


1 


1 


1 


1 


3 


3 


1 


1 


1 


1 


3 


1 


1 


To 


1 


2 


3 


1 


1 


5 


1 


1 


3 


1 


1 


1 


7 


2 


3 


1 


1 


1 


1 
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TABLE XXVII: Observable sector matter states without hidden sector charges for the 

three-generation SU{5) (g) U{1) model. 



QTY 


SU{3) 


SU{3) 


5[/(4) 


SU{5) 


SU{7) 


2 


1 


3 


1 


5 


1 


1 


1 


1 


4 


5 


1 


2 


1 


1 


1 


5 


1 


3 


1 


1 


1 


5 


1 


1 


1 


1 


4 


5 


1 


2 


1 


3 


1 


5 


1 


1 


3 


1 


1 


lo 


1 


2 


3 


1 


1 


5 


1 



generations of mattei]^ However, counting the hidden sector charges as duplicates, there 
are 14 extra 5's and 8 extra 5's. Because of the numerous U{1) charges, this model is ideal 
for future U{1) and flat direction analysis. 



^Recall that the definition of barred and unbarred representations is arbitrary. 
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TABLE XXVIII: A basis vector and kij matrix row which produces a three-generation 

Left-Right Symmetric model. 



Sec 


Nr 






^34 


^56 


- 1,...,5 


r] ^ 


r] ^ 


7/ 


- 1,...,8 


V 


3 


1 











2 2 
3'-"'3 


2 
3 


2 

3 


2 

3 


^'••■'^'3' 3' 3 



Sec 





y ^'^ly ^'^ 


^ S^.-.^Sjl^ 3,. ..,6 




V 


3 


o,o,i,i||o,o,|, 1 


o,o,o,o||o,o,i,i 


"J)^)-^'-^!! 3)3)353 



A:,^,,- = (0, 1, 0, 0, 0) 



B. A Three Generation Left-Right Symmetric Model 

Presented in this section is an explicitly constructed three-generation Left-Right Sym- 
metric NAHE based model. The gauge group for this model is SU (2)^ (g) SU (3)^ SU (5) ® 
5*0(10) (S) U{iy, and it has N=l ST SUSY. The basis vectors for this model are given 
in Table [XXVIII[ The particle content of this model is presented in Table XXIX and the 
observable matter is presented in Table XXX[ 



This model has three net generations of quarks, but no net generations of anti-quarks. 
Additionally there are thirty left- and right-handed lepton doublets. Other exotics include 
a quark triplet with left- and right-handed isospin, eight quark and ten anti-quark triplets 
without isospin. Thus, this model is not a favorable candidate for a quasi-realistic three- 
generation model. It does serve as a proof of concept that three generation models can be 
built with single-layer extensions to the NAHE set, however. 



VII. CONCLUSIONS 

The statistics presented in this paper make it clear that the NAHE set does serve its 
intended purpose as a basis for quasi-realistic WCFFHS models at a statistical level. Three 
generation models were constructed from order-3 extensions to the NAHE set. A summary 



of the GUT group analysis is presented in Table XXXI Two three-generation models were 



discussed - a Aipped-SU (5) model and a Left-Right Symmetric model. While they did have 
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TABLE XXIX: The particle content of the three-generation Left-Right 
This model also has 7 t/(l)'s and = 1 ST SUSY. 



Symmetric Model. 



1 I 


bU [Z)l 


bU{Z)ji 


bU\6)c 


DU [o) 


DU [0) 


DU(1(J ) 


1 


2 


2 


3 


1 


1 


1 


2 


2 


1 


3 


1 


1 


1 


2 


2 


1 


1 


3 


1 


1 


3 


2 


1 


1 


1 


1 


1 


3 


2 


1 


1 


1 


5 


1 


1 


2 


1 


1 


3 


1 


1 


1 


2 


1 


3 


1 


1 


1 


2 


1 


2 


1 


3 
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TABLE XXX: The observable matter content of the three-generation Left-Right 

Symmetric Model. 



QTY 


SU{2)l 


SU{2)r 
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the requisite number of chiral matter generations, there were several unfavorable properties 
in both models that prevented them from being considered quasi-realistic. They are a proof 
that three-generation models with geometric interpretations can be built with order-3 basis 
vector extensions. 

The distributions of ST SUSYs across the GUT group subsets remained largely the same, 
save the Eq models, which displayed a greater statistical tendency for ST SUSY enhance- 
ments. Further examination revealed that all of the exceptional groups, as well as SU{9) 
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TABLE XXXI: A summary of the GUT group study with regard to the number of chiral 

fcrmion generations in the NAHE set investigation. 



GUT 


Net Chiral Generations? 


Three Generations? 


02L1 SO{W) 


Yes 


No 


02L1 Pati-Salam 


No 


No 


03L1 Ee 


Yes 


No 


03L1 50(10) 


Yes 


No 


03L1 SU{5) ® U{1) 


Yes 


Yes 


03L1 Pati-Salam 


Yes 


No 


03L1 L-R Symmetric 


Yes 


Yes 


03L1 MSSM 


Yes 


Yes 



and SU{12), display more enhanced ST SUSY models for the NAHE + 03L1 data set. It 

— * 

was also shown that the presence of the S did not significantly impact the gauge content. 
However, the matter content of the models without S is affected, as the S sector produces 
states other than SUSY partners. 
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